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Description 

Related Applications ' 

5 This application claims the benefit of U S Provisional Application No 60/016 532 filed May 3. 1 996 

Background of the Invention 

The generation or destruction of tissue requires constant reorganization and restructuring of the extracellular matrix 
} o fECM) components including interstitial collagens basement membrane collagen, fibronectm lammin aggrecan and 
various proteoglycans Heinegard and Oldberg FASEBJ 1989 3 2C42-2051 Woessner FASEBJ 1991 5 214-2154. 
Normal types of remodeling processes include embryonic development post-partum involution of the uterus, ovulation, 
wound healing, and bone and growth plate remodeling Woessner et al Steroids 1989 54. 491-499: Weeks et al 
Biochim Btophys Acta 1976 445. 205-214 Lepage and Gache EMBO J 1990 9. 3003-3012. Wride and Sanders Dev- 
is Oyn. 1993 1 93(3) 225-39 Similar processes also occur in disease states such as pint destruction in rheumatoid and 
osteoarthritis, periodontia and tumor cell metastasis. Thompson and Oegema J Bone Joint Surg. 1979. 61 407-16; 
Reynolds et al Adv-Dent-Res 1 994. 8(2) 31 2-9 One example of these processes is the migration of macrophages to 
the site of inflammation as in the case of synovial tissue in rheumatoid arthritis Cutolo et al Clin, and Exper Rheum. 
1993. 11 331-339. The ECM components are regulated, in both normal and disease states, by various exogenous 
20 and endogenous factors For example, in tumor formation the differentiation state of the cell can increase the rate of 
degradation of the ECM Benya Pathol Immunopathol Res. 1988 7 51-54 Likewise, the presence of metalioprotei- 
nases or their inhibitors can alter the composition of the ECM An imbalance of metalloprotemases and tissue inhibitors 
of matrix metalloprotemases (TIMP) has been shown to contribute to the pathogenesis of osteoarthritis Dean et al. J. 
Clin. Invest. 1989. 54 678-635 Cytokines, growth factors, and the extracellular environment can all contribute to the 
25 alteration of the ECM Tyler BiochemJ. 1985 227 369-37S: Dinarello Sem Immunol 1992 4. 1 33-1 45. McConnell et 
al J Cell Biol. 1987. 105. 1067-96 

The growth of cartilage and bone is actualized by cells such as articular chondrocytes and osteoblasts The main 
function of these cells in immature tissue is the deposition and remodeling of the cartilage or bone matrix In adult 
t:ssue. these cel!s maintain this matrix in order to ensure its proper function In both cases, this encompasses secretion 
30 of the extracellular components as we! 1 as secretion of proteins involved in the turnover of the ECM 

A major species of protein secreted by these cells and involved in the turnover of the ECM are the metalloprotei- 
nases Woessner FASEBJ 1991 5. 214-2154 A new type of secretory glycoprotein has also been identified in human 
cartilage osteoblasts synovial ceils sheep and bovine oviduct and mammary cells and macrophages Nynkos and 
Golds Biochem J 1990 263. 265-265 Hakala et al J Biol Chem 1993. 263(34) 25303-25310 Johansen et al. J 
3S Bone and Mm Res. 1992 7(5) 501-511: Rejman and Hurley Biochem. Btophys. Res Commun 1988. 150 329-334: 
DeSouza and Murray Endocrinology 1995 136(6) 2435-2496 Hollak et al J Clin. Invest. 1994. 93. 1233-92: Anas et 
al. Biol, of Reproduction ^94. 51 655-694 These novel mamma'ian proteins all share regions of significant homology 
to the bacterial and fungal chitmases and therefore are referred to herein as "chitmase-hke" proteins Chttmases are 
enzymes that hydrolyze glycosidic bonds They bear a subtle similarity to iysozymes from mammals and function as 
endoglycos^dases with a specificity for N-acetyl-glucosamme linkages However, these types of chitin-like structures, 
homopoiymers of N-acetyl-glucosamine are not normally encountered in mammalian tissue. 

The human cartilage glycoprotein. HC gp-39 is a protein with an apparent molecular weight of approximately 39 
kDa secreted by both articular choncrocytes anc synovial fibroblasts Nynkos and Golds Biochem J 1990 263. 
265-263 Hakaia et al J Biol Chem 1993 263(34) 25303-25310 This protein has been described as a marker for 
^ joint injury appearing in the blood and synovial fluid from patients diagnosed with rheumatoid arthritis Johansen et al 
British J of Rheumatology 1993 32 949-955 The gene encoding this protein has been cloned and is expressed 
specifically in cartilage and synovial cells of rheumatic joints Hakala et al J Biol. Chem. 1993 263(34) 25303-25310 
T he prctein YKl-40 has also been identified as one of the major secretory products of cultured human osteoblastic 
cells (osteocarcinoma cell Ime MG-63i expressed in response to 1 25-dihydroxyvitamm D3 stimulation Johansen et 
so al J Bone ana Mm. Res 1992.7(5). 501-511 johansen et al Br J Rheumatol 1993 32 949-55 The N-termmal 
portion of YKL-40 was sequenced and found to be identical to HC gp-39 Upon further sequencing. YKL-40 and HC 
gp-3- were lound to be identical. 

Chitotnosidase is an enzyme which has been identified as a member of this "chitinase-like" family Renkema et 
al J Biol Chem. 1995 27C 2198-2202 Hollak et al J Clin Invest. 1994. 93. 1238-92 This protein also has an 
55 apparent molecular weight of 39 kDa and shares N-terminal homologies with HC gp-39 the bovine mammary protein 
and several bacterial chitmases Activity of this enzyme was originally detected from cells of patients afflicted with 
Gaucher Disease (GD) Gaucher Disease is an mhentea deficiency m the activity of glucocerebrostdase a lysosomal 
hydrolase This defect results in an accumulation of glucosylceramide (glucocerebroside) in the lysosomes of macro- 
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phages Accumulation of lipid-iaden macrophages results in hepatosplenomegaly bone lesions and neurological 
anomalies After morphological differentiation of monocytes into macrophages in culture the cells begin to produce 
and secrete increasing amounts of chitotriosidase This increase is on average 600 times greater in GD patients than 
in patients with other pathological conditions The elevation in chitotriosidase activity can be effectively reduced, how- 

5 ever, upon initiation of enzyme supplementation therapy Unlike the other members of the chitmase-like family, chitot- 
riosidase has chitolytic activity Like the bacterial enzyme, it has the ability to degrade chitin azure a polymer of beta- 
1 -4-linked N-acetylglucosamme moieties. 

A new lymphocyte-associated protein of the chitmase-like family, referred to as HC gp-39L. has now been identified 
HC-gp39L protien is believed to be tnvoivea in tissue remodeling in the mammalian cell and thus serve as useful tools 

>o m the development of therapeutics and diagnostics for tissue remodeling disorders 

Summary of the Invention 

In accordance with one aspect of the present invention there is provided a novel mature polypeptide which is HC 
-5 gp-39L as well as biologically active and dtagnostically or therapeutically useful fragments, analogs and derivatives 
thereof The polypeptide of the present invention is of human origin 

In accordance with another aspect of the present invention, there are provided isolated nucleic acid molecules 
encoding human HC gp-39L including mRNAs. DNAs cDNAs. genomic DNAs as well as analogs and biologically 
active and diagnosticaily or therapeutically useful fragments thereof. 
20 In accordance with yet a further aspect of the present invention there is provided a process for producing HC gp- 

39L by recomoinant techniques comprising culturmg recombinant prokaryotic and/or eukaryotic host cells containing 
a human nucleic acid sequence of HC gp-39L under conditions promoting expression of these proteins and subsequent 
recovery of these proteins 

In accordance with yet a further aspect of the present invention, there is provided a process for utilizing HC gp- 
25 39L polypeptide, for therapeutic purposes, for example in the treatment of tissue remodeling disorders 

In accordance with yet a further aspect of the present invention there is provided a process for targeting HC gp- 
39L located on the surface of lymphocytes with specific antibodies, for therapeutic purposes, such as the treatment of 
autoimmune or tissue remodeling disorders. 

In accordance with yet a further aspect of the present invention there is provided a process for utilizing HC gp- 
30 39L polypeptide for diagnostic purposes for example in the diagnosis of tissue remodeling disorders 

These and other asoects of the present invention should be apparent to those skilled in the art from the teachings 
herein 

Brief Description of the Drawings 

35 

Figure 1 provides the polynucleotide coding sequence (SEQ ID NO 1 ) and deduced amino acid sequence (SEQ 
ID NO 2) of the polypeptide HC gp-39L 

Detailed Description of the Invention 

40 

In accordance with one aspect of the present invention, there is provided an isolated nucleic acid sequence (SEQ 
ID NO I ) which encodes the chitmase-like protein HC gp-39L (HC gp-39 like) The deduced ammo acid sequence of 
this protein (SEQ ID NO 2) is shown in Figure 1 Further. HC gp-39L has been isolated from lymphocytes and has 
been found to share 51% amino acid identity (6S°o ammo acid similarity) with the chitmase-like protein HC gp-39 
-5 including the conserved chitmase-like domains and cysteine motif However HC gp-39L differs from the other chitinase- 
like proteins in that the cDNA extenas past the homology with HC gp-39 and chitotriosidase producing an N-terminal 
extension 

The nucieic acid sequences of the present invention may be in the form of RNA or in the form of DNA which DNA 
includes cDNA. genomic DNA and synthetic DNA The DNA may be double-stranded or single-stranded, and if single 
50 stranded may be the coding strand or non-coding (anti-sense) strand The coding sequence which encodes HC gp- 
39L protein may be identical to the coding sequence shown in Figure 1 or may be a different coding sequence which 
coding sequence, as a result of the redundancy or degeneracy of the genetic code encodes the same protein as the 
DNA of Figure 1 

The present invention includes polynucleotides encoding the same mature polypeptide as shown in Figure 1 
^ Further the inventions includes variants of such polynucleotides that encode a fragment derivative or analog of the 
polypeptides of Figure 1 Among variants in this regard are variants that differ from the aforementioned polynucleotides 
by nucleotide substitutions deletions or additions The substitutions, deletions or additions may involve one or more 
nucleotides The variants may be altered in coding or non-coding regions or both Alterations in the coding regions 
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may produce conservative amino acid substitutions deletions or additions 

Variants of the invention may have a sequence that occurs tn nature or they may have a sequence that does not 
occur naturally As herein above indicated, the polynucleotide may nave a coding sequence which is a naturally oc- 
curring allelic variant of the coding sequences shown in Figure 1 As known in the art an allelic variant is an alternate 

5 form of a polynucleotide sequences which may have a substitution deletion or addition of one or more nucleotides 
Among the particularly preferred embodiments of the inventions in this regard are polynucleotides encoding 
polypeptides having the amino acid sequences of HC gp-39L set out in Figure V variants analogs derivatives and 
fragments thereof and fragments of the variants, analogs and derivatives 

Further particularly preferred in this regard are polynucleotides encoding HC gp-39L variants anaiogs derivatives 

■0 and fragments and variants analogs and derivatives of the fragments, which have the amino acid sequence of HC 
gp-39L polypeptide of Figure 1 in which several, a few 5 to 10 1 to 5 1 to 3 ; 2 1 or no amino acid residues are 
suDsn*.uted. deleted or added in any combmaton Especially preferred among these are silent substitutions additions 
and deletions, which do not alter the properties and activities of HC gp-39L Also especially preferred in this regard 
are conservative substitutions most highly preferred are polypeptides having the amino ac;d sequence of Figure 1 

75 without substitutions 

Further preferred embodiments of the invention are polynucleotides that are more than 65% identical to a polynu- 
cleotide encoding HC gp-39L polypeptide having the amino acid sequence set out in Figure 1 . or variants close ho- 
mologs. derivatives and analogs thereof as described above Alternatively, most highly preferred are polynucleotides 
that comprise a region that is more than 35% identical to a polynucleotide encoding HC gp-39L polypeptide of Figure 

20 1. in tms regard, polynucleotides more than 90% identical to the same are particularly preferred, and among these 
particularly preferred polynucleotides those with 95% or more identity are especially preferred Furthermore, those 
with 97% or more identity are highly preferred among those with 95% or more identity and among these those with 
98% or more and 99% or more identity are particularly highly preferred, with 99% or more being the more preferred 
Also particularly preferred in this regard are polynucleotides encoding a polypeptide having the amino acid se- 

25 quence of HC gp-39L set out in Figure 1 As set out elsewhere herein, the polynucleotide may encode the polypeptide 
I a continuous region or in a plurality of two or more discontinuous exons. and it may comprise additional regions as 
well which are unrelated to the coding region or regions. 

Most highly preferred in this regard are polynucleotides that comprise a region that is more than 35% identical to 
tne HC gp-39L-encodmg portion of the polynucelotide set out in Figure 1 Alternatively, most highly preferred are poly- 

30 nucleotides that comprise a region that is more than 85% identical to the HC gp-39L-encodmg portion set forth in Figure 
1 Among such polynucleotides those more than 90% identical to the same are particularly preferred and among 
these particularly preferred polynucleotides those with 95% or more identity are especially preferred Furthermore, 
those with 97% or more identity are highly preferred among those witn 95% or more identity, and among these those 
with 93% or more and 99% or more identity are particularly highly preferred with 99% or more being the more preferred 

25 of these 

The present invention also includes polynucleotides in which the sequence encoding the mature polypeptide is 
fused in the same reading frame to additional sequences Such sequences include signal sequences, which facilitate 
transport of the nascent protein into the endoplasmic reticulum, pro-sequences that are associated with inactive pre- 
cursor - forms of the polypeptide which may facilitate trafficking of the protein in a cell or out of a cell or may improve 

-0 pers-stence of the protein in a cell or in an extracellular compartment Such sequences also may be added to facilitate 
production and purification or to add additional functional domains as discussed elsewhere herein. 

A further preferred embodiment is an HC gp-39L polynucleotide sequence comprising a Met codon upstream (5') 
of the 5' sequence in SEQ ID 1 It is most preferred that the Met codon be part of a continuing open reading frame as 
tnat encoding the polypeptide of Figure 1 (SEQ ID 2). Using the sequence of HC gp-39L of SEQ ID NO 1 skilled artisans 

4 o can obtain such upstream or other upstream sequences by making primers to perform amplification using routine 
methods Preferred primers useful to derive upstream sequences are obtained from the sequence 

_ n 5'-cagcacctgtggctggggagcccagatgaagtgtggctctatcttgtaTGTGAG 

CACACCCACATTTTCACTGCCATTATCTGGGACAGCagaaccaggmggctcaacagat-3 , 

Fragments of this 'sequence are also useful, preferably fragments of about 15-50 nucleotides in length nnd most 
ss preferably 20-25 nucleotides in length 

The present invention further relates to the HC gp-39L polypeptide which has the deduced ammo acid sequence 
fSE3 ID NO 2) of Figure 1 as welt as fragments analogs and derivatives of such polypeptide 

The terms "fragment " "derivative" and "analog" when referring to the polypeptide of Figure 1 means a polypeptide 
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which retains essentially the same biological function or activity as such polypeptide Thus an analog includes a pro- 
protein which carfbe activated by cieavage of the proprotein portion to produce an active mature polypeotide The 
polypeptide of the presgnt invention may bea recombinant polypeptide a natural polypeptide ora synthetic polypeptide 
preferably a recombinant polypeptide 

$ The fragment derivative or analog of the polypeptide of Figure 1 may be (0 one in which one or more of the ammo 

acid residues are substituted with a conserved or non-conserved ammo acid residue (preferably a conserved amino 
acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic code or (iij one 
in which one or more of the amino acid residues includes a substituent group or (in) one in which the mature polypeptide 
is fused with another compound such as a compound to increase the half-life of the polypeptide (for example, poly- 

} o ethylene glycol), or (iv) one in which the additional amino acids are fused to the mature polypeptide such as a leader 
or secretory sequence or a sequence which is employed for purification of the mature polypeptide or a proprotein 
sequence Such fragments, derivatives ana analogs are deemed to be within the scope of those skilled in the art from 
the teachings herein 

The polypeptide and nucleic acid sequences of the present invention are preferably provided in an isolated form. 

15 and preferaDly are purified to homogeneity The term "isolated" means that the material is removed from its original 
environment (e.g.. the natural environment if it is naturally occurring) For example a naturally-occurring nucleic acid 
sequence or polypeptide present in a living animal is not isolated, but the same nucleic acid sequence or polypeptice. 
separated from some or all of the coexisting materials in the natural system, is isolated Such nucleic acid sequences 
could be part of a vector and/or such nucleic acid sequences or polypeptides could be part of a composition and still 

20 be isolated in that such vector or composition is not part of its natural environment 

The present invention also relates to vectors which include one of the nucleic acid sequences of the present 
invention, host cells which are genetically engineered with vectors of the invention and the production of polypeptides 
of the invention by recombinant techniques 

Host cells are genetically engineered (transduced or transformed or transfectedt with the vectors of this invention 

25 which may be, for example, a cloning vector or an expression vector The vector may be for example in the form of 
a plasmid. a viral particle, a phage, etc The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters selecting transformants or amplifying HC gp-39L gene The culture 
conditions, such as temperature. pH and the like, are those previously used with the host cell selected for expression, 
gnd will be apparent to the ordinarily skilled artisan. 

JO The nucleic acid sequences of the present invention may be employed for producing polypeptides by recombinant 

techniques The nucleic acid sequences may be included in any one of a variety of expression vectors for exoressmg 
a ooiypeptide Such vectors include chromosomal, nonchromosoma! and synthetic DNA sequences e g derivatives 
of SV40 bacterial plasmids phage DNA bacuiovirus. yeast plasmids vectors derived from combinations of plasmids 
and phage DNA viral DNA sucn as vaccinia, adenovirus, fowl pox virus, and pseudorabies However any other vector 

35 may be used as long as it is repltcable and viable in the host. 

The DNA sequence of HC gp-39L can be inserted into the vector by a variety of procedures In general the DNA 
sequence is inserted into an appropriate restriction endonuclease site(s) by procedures known in the art Such proce- 
dures and others are deemed to be within the scope of those skilled in the art. 

Tne DNA sequence in the expression vector is operatively linked to an appropriate expression control sequence 

J o (si (promoter) to direct mRNA synthesis As representative examples of such promoters, there may be mentioned 
L~R or SV40 promoter, the E coil lac or trp. the phage lambda P L promoter and other promoters known to control 
expression of genes in prokaryotic or eukaryotic cells or their viruses. The expression vector also contains a nbosome 
binding site for translation initiation and a transcription terminator The vector may also include appropriate sequences 
for amplifying expression 

^ 5 In addition the expression vectors preferably contain one or more selectable marker genes to provide a phenotypic 

trait for selection of transformed host cells such as dihydrofolate reductase or neomycin resistance for eukaryotic cell 
culture or such as tetracycline or ampicillm resistance in E colt 

The vector containing the appropriate DNA sequence as hereinabove described as well as an appropriate p r o- 
moter or control sequence may be employed to transform an appropriate host to permit :he host to express the protein 

50 As representative examples of appropriate hosts, there may be mentioned; bacterial cells such as E coii. Strep- 

tomyces. Salmonella typhimunum; fungal ceils such as yeast insect cells such as Drosophila S2and Spodoptera Sf9. 
animal cells such as CHO CCS or Bowes melanoma adenoviruses, plant ceils, etc The selection of an aporoonate 
host is deemed to be within the scope of those skilled in the art from the teachings herein 

More particularly, the present invention aiso includes recombinant constructs comprising one or more of the se- 

~5 quoncos as described above The constructs comprise a vector such as a plasmid or viral vector into which a sequence 
encoding HC gp-39L has been inserted, in a forward or reverse orientation in a preferred aspect of this embodiment 
the construct further comprises regulatory sequences including for example a promoter operably linked to the se- 
quence Large numbers of suitable vectors and promoters are known to those of skill in tne art and are commercially 
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available The fofjpwing vectors are provided by way of example Bacterial pQE70 pGE60 pGE-Q 'Qiagen) pBS 
pD^O Dhagescript psiX174 pbluescnpt SK. pbsks pNHSA pNHl6a pNHISA pNH46A (Stratagene) ptrc99a 
pK.K223-3 DKK233-3 pDR540 pRIT5 (Pharmacia) Eukaryotic pWLNEO pSV2CAT pOG44 pXTl pSG ( Stratagene) 
pSVK3 pBPV pMSG pSVL (Pharmacia) However any other plasmid or vector may be used as long as they are 
rephcable and viable in the host 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other 
vectors with selectable markers Two appropriate vectors are pKK232-3 and pCM7 Particular named bacterial pro- 
moters include iacl iacZ T3 T7 gpt lambda P R P L and trp Eukaryotic promoters include CMV immediate early HSV 
thymidine kinase, early and late SV40 LTRs from retrovirus and mouse metallothionem-l Selection of the appropriate 
vector and promoter :s well within the level of ordinary skill in the art 

in a furtner emoodimen! the oresent invention relates to host cells containing the above-described constructs 
The p DSt cell can be a higher euKarvotic cell, such as a mammalian cell or a lower eukaryotic ceil sucn as a yeast 
cell or the host ce i: can be a prokaiyotic cell, such as a bacterial cell Introduction of the construct into the host ceil 
can be effected by calcium phosohate transfection DEAE-Dextran mediated transfection or electroporation (Davis 
L . Dibner M Battey < . Basic Methods in Molecular Biology, (1986)) 

Tne constructs in host ceils can be used in a conventional manner to produce the gene product encoded by the 
recombinant sequence Alternatively polypeptides can be synthetically produced by convent'onal peptide synthesizers 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appro- 
priate promoters Cell-free translation systems can also be employed to produce such proteins using PNAs derived 
from the DNA constructs of the present invention Appropriate cloning and expression vectors for use with prokaryotic 
and eukaryotic hosts are described by Sambrook et al.. Molecular Cloning A Laboratory Manual, Second Edition Cold 
Spring harbor. N Y. (1969). the disclosure of which is hereby incorporated by reference 

Transcription ot the DNA encoding a orotem by higher eukaryotes is increased by inserting an enhancer sequence 
into the vector. Enhancers are cts-actmg elements of DNA. usually about from 10 to 300 bp that act on a promoter to 
increase its transcription Examples including the SV4C enhancer on the late side of the replication origin bp 100 to 
270 a cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side ot the replication origin, and 
aaenov.rus enhancers 

Generally, recombinant expression vectors will mc ! ude origins of replication and selectable markers permitting 
transformation of the host cell e g the ampicilltn resistance gene of E coli and S cerevisiae TRP1 gene and a 
promoter derived from a highly-expressed gene to direct transcription of a downstream structural sequence Sucn 
promoters can be cenved from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK) a- 
factor acid phosphatase, or neat shock proteins among others. The heterologous structural sequence is assembled 
in appropriate ohase with translation initiation and termination sequences and preferably a leader sequence capable 
of directing sec r etr:n of translated protein into the penplasmic space or extracellular medium Optionally the hetero 1 - 
ogous sequence can encode a fusion protein including an N-terminal identification peptide imparting desired charac- 
teristics e g stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a structura 1 DNA sequence encoding a 
desired protein together with suitable translation initiation and termination signals in operaole reading phase with a 
functional oromoter The vector will comprise one or more phenotypic selectaole markers and an origin M replication 
to ensure maintenance of the vector and to. tf aesirabie provide amplification within the host. Suitable prokaryotic 
hosts for transformation include E celt. Bacillus subtilis. Salmonella typhimunum and various species with'n the genera 
Pseudomonas. Streptomyces. and Staphylococcus, although others may also be employed as a matter of choice 

As a representative but noniimiting example, useful expression vectors for bacterial use can comprise a selectable 
marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements 
of the well known cloning vector pBR322 (ATCC 37017} Such commercial vectors include, for example. pKK223-3 
(Pnarmacia Fine Chemicals Uppsala Sweden) and GEM1 (Promega Biotec. Madison. Wl. USA) These pBR322 
"backbone" sections are combined with an appropriate promoter and the structural sequence to be expressed 

Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density the 
selected promoter is inducea by appropriate means (e g temperature shift or chemical induction) and cells are cultured 
fo r an additional period 

Cells are typically harvested oy centnfugation disrupted by physical or chemical means, and the resulting cruae 
extract retained for further purification 

Microbial cells employed m expression of proteins can be disrupted by any convenient method including freeze- 
thaw cycling sonication mechanical disruption, or use of cell lysmg agents such methods are welt know to these 
skilled m the art 

Various mammalian cell culture systems can also be employed to express recombinant proteins Examples of 
mammalian expression systems include the COS-7 lines of monkey ktdney fibroblasts described by Giuzman Cell 
1951 23 1 75. and other cell lines capable of expressing a compatible vector for example, the C127 3T3 CHO HeLa 



EP 0 805 206 A2 



and BHK ceil lines Mammalian expression ve:tors will comprise an origin of replication a suitable promoter and en- 
nancer and also~any necessary nbosome binding sites polyadenylation site splice donor and acceptor sites tran- 
scriptional termination sequences and 5' flankng nontranscnbed sequences DNA sequences derived from the SV40 
splice and polyadenylation sites may De used to proviae the required nontranscnbed genetic elements 

5 Polypeptides can then be recovered and purified from recombinant cell cultures by methods including ammonium 

sulfate or ethanol precipitation acid extraction anion or cation exchange chromatography phosphocellulose chroma- 
tography hydrophobic interaction chromatography affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Protein refolding steps can be used as necessary, in completing configuration of the mature protein 
Finally high performance liquid chromatography (HPlCj can be employed for final purification steps. 

■0 it is now believed thai the chitinase-lt*e proteins particularly HC gp-39L are involved in tissue remodeling HC gp- 

39 HC gp-39L and human chitctnosiase are all homologous to microbial chitmases in particular all three proteins 
share a similar cysteine motif and homology to tne active site of microbial chitmases Accordingly, these proteins can 
be used in the development of treatments for tissue remodeling diseases and in the diagnosis of these diseases 
3ased on the homology with HC gp-39 H3 gp-39L is oeheved to be a product of macrophages Primary human 

1 -5 monocytes activated to become macrophages through adherence to p;astic secrete high levels of HC gp-39 into the 
culture media. In addition the induction of HC gp-39 expression has been correlated with the differentiation of myeloid 
cell lines HL60 and U937 toward a macrophage lineage by induction with phorbol ester HC gp-39 message can be 
detected in human athe r osclerotic plaques delved from endarterectomies Accordingly it is believed that HC gp-39 
functions m various tissues undergoing extensive remodeling, including those associated with rheumatoid arthritis and 

20 atherosclerosis where macrophages play an important role Further rHC gp-39 was shown to stimulate in vitro smooth 
muscle cell migration, indicative of involvement in the remodeling processes occurring in diseased arteries. 

The tissue-specific expression pattern of HC gp-39L has alsobeen determined HC gp-39L is expressed specifically 
in lymphocyte cell lines which is consistent with tissues from which HC gp-39L is found In addition, human multiple 
tissue Northern blots show that HC gp-39l message is found at the highest levels in thymus and spleen which also 

25 consistent with lymphocyte-specific expression HC gp-39L protein was detected by western blot using polyclonal 
antiserum multiple produced from 'abbits immunized with a fusion construct containing a partial HC gp-39L protein 
HC gp-39L protein is associated with the membrane fraction of the cells and is not secreted into the media. Accordingly 
,t is believed that HC gp-39L functions as a cell surface lymphocyte marker for activated lymphocytes HC gp-39L is 
therefore a useful target for inflammatory or tissue remodeling diseases involving T-cell or B-cell activation such as 

oo rheumatoid arthritis 

HC gp-39L proteins can also be used in the production of antibodies 'ne proteins their fragments or other deriv- 
atives, or analogs thereof or cells expressing tnem can be used as an immunogen to produce antibodies thereto 
These antibodies can be for example polyclonal or monoclonal antibodies. The present invention also includes chi- 
meric single chain, and humanized antibodies as wel' as Fab fragments, or the product ot an Fab expression library. 
35 Various procedures Known in the art may oe used for the proauction of such antibodies and fragments 

Antibodies generated against these proteins can be obtained by direct injection of the protein into an animal 
preferably a nonhuman. The antibody so obtained will then bind the protein itself In this manner even a sequence 
encoding only a fragment of the protein can be used to generate antibodies binding the whole native protein Such 
antibodies can then oe used to isolate these protems from tissue expressing these proteins 

For preparation of monoclonal antibodies any technique which orovides antibodies produced by continuous cell 
line cultures can be used Examples include the hybnaoma technique (Kohier and Mil stein Nature 1975 256 495-497) 
the trioma techniaue. the human 3-cell hybridcma tecnnique ■ Kozbor et al Immunology Today 1983. 4. 72). and the 
EBV-hybr.doma technique to produce human monoclone' antiboaies (Cole e: al in Monoclonal Antibodies and Cancer 
Therapy Alan R Liss Inc 1955 pp 77-96) 

Techniques described tor the production :>f sir gle chain antibodies fU S Patent 4 946 773) can be adapted to 
produce single chain antibodies to immunogenic HC :jp-39L protein product Also transgenic mice may be used to 
express humanized antibodies to immunogenic HC gp-39L protein products 

This invention is also relatec to the use of HC gp-39L as a diagnostic Detection of a mutated forms of HC gp-39L 
will allow a diagnosis of a tissue remodeling disease sucn as rheumatoid ar.hntis or atherosclerosis. 
50 Individuals carrying mutations in one or more of these HC gp-39L proteins may be detected at the DNA level by 

a variety of techniques Nucleic acids for diagnosis may be obtained from a patient's cells such as from blood urine 
saliva tissue biopsy and autoDsy material The genomic DNA may be used directly for detection or may be amplified 
enzymatically by using PCR iSatki et al . Nature 1986 324. 163-1 56 1 prior to analysis RNA or cDNA may also be used 
for the same purpose As an examoie PCR pr;me r s complementary to the nucleic acid encoding HC gp-39L can be 
55 used to identify and analyze mutations m 'his protein P or example, deletions and insertions can be detected by a 
change in size of the amplified product m comparison to the normal genotype Point mutations can be identified by 
hybridizing amplified DNA to radiolabeled RNA or alternatively, radiolabeled antisense DNA sequences Perfectly 
matched sequences can be distinguished from mismatched duplexes by RNase A digestion or by differences tn melting 
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temperatures _ 

Sequence differences between the reference gene and genes having mutations may be revealed by the direct 
DNA sequencing method In addition cioned DNA segments may be employed as probes to aetect specific DNA 
segments The sensitivity of ihis method is greatly enhanced when combined with PCR For example a sequencing 

$ primer is used with double-stranded PCR product or a single-stranded template molecule generated by a modified 
PCR The sequence determination is performed by conventional procedures with radiolabeled nucleotide or by auto- 
matic sequencing procedures with fluorescent-tags. 

Genetic testing based on DNA sequence differences may be achieved by detection of alteration in electrophoretic 
mobility of DNA fragments in gels with or without denaturing agents Small sequence deletions and insertions can be 

1 o visualized by high resolution gel electrophoresis DNA fragments of different sequences may be distinguished on de- 
naturing formamide gradient geis m which the mobilities of different DNA fragments are retarded in the gel at different 
positions according to their specific melting or partial melting temperatures (see. e.g.. Myers et ai Science 1985 230 
1242). Sequence changes at specific locations may also be reveaied by nuclease protection assays such as RNase 
and S1 protection or the chemical cleavage method (e g Cotton et at PNAS USA 1985. 85. 4397-4401 

■5 Thus the detection of a specific DNA sequence may be achieved by methods such as hybridization RNase pro- 

tection, chemical cleavage direct DNA sequencing or the use of restriction enzymes, (e.g.. Restriction Fragment Length 
Polymorphisms (RFLP)) and Southern blotting of genomic DNA 

In addition to more conventional gel-electrophoresis and DNA sequencing, mutations can also be detected by in 
situ analysis 

20 The present invention also relates to a diagnostic assay for detecting altered levels of HC gp-39L protein in various 

tissues Assays used to detect levels of such protein in a sample derived from a host are well-known to those of skill 
in the art and include radioimmunoassays, competitive-binding assays Western Blot analysis and preferably an ELI SA 
assay An ELISA assay initially comprises preparing an antibody specific to a HC gp-39L protein, preferably a mono- 
clonal antibody. In addition a reporter antibody is prepared against the monoclonal antibody To the reporter antibody 

25 is attached a detectable reagent such as radioactivity, fluorescence or in this example a horseradish peroxidase en- 
zyme A sample is removed from a host and incubated on a solid support, eg. a polystyrene dish, that binds the 
proteins m the sample Any free protein binding sites on the dish are then covered by incubating with a non-specific 
protein like 3SA Next, the monoclonal antibody is incubated in the dish during which time the monoclonal antibodies 
attach to HC gp-39L protein attaches to the polystyrene dish All unbound monoclonal antibody is washed out with 

30 buffer The reporter antibody linked to horseradish peroxidase is now placed in the dish resulting m binding of the 
reporter antioody to any monoclonal antibody bound to HC gp-39L Unattached reporter antibody is then washed out 
Peroxidase substrates are then added to the dish and the amount of color developed in a given time period is a meas- 
urement of the amount of HC gp-39L protein present : n a given volume of patient sample when compared against a 
s r andard curve 

35 A competition assay may be employed wherein antibodies specific to HC gp-39L protein are attached to a solid 

support and labeled protein and a sample derived from the host are passed over the solid support and the amount of 
laoel detected attached to the solid support can be correlated to a quantity of HC gp-39L protein in the sample 
The following examples are provided for illustrative purposes only and are not intended to limit the invention 

+o EXAMPLES 

Example 1: Expression of HC gp-39 in vitro 

Recombinant HC gp-39 was produced in vitro by transfecting an expression vector containing the cDNA into CHO 
45 cells and selecting stable cell lines 

Tne full lengtn HC gp-39 gene was cloned into CDN in two pieces a 660 bp Sac ll-Bst Ell fragment plus a 673 bp 
Bst Ell -Bel fragment ngated together with the CDN vector cut with Sac 1 1 -Bel I. This construct was transacted into 
CHO ACC 317 Cells by standard methods Specifically. 20 mg of the HC gp-39 plasmid construct was linearized by 
restriction digestion and electroporated into 1 25 x 10 7 cell ;n 1 m! Cells were seeded at a density of 2 5 x 10 3 cells 
50 per well and selected in minimal media in the absence of nucleosides Secreted protein was recovered from the con- 
ditioned media and purified using Q sepharose flow through, S sepharose capture and sized on Suprose 12 The 
' esultmq material was greater than 95% Dure as determined by Coomassie blue staining HC gp-39L can also be 
expressed as described in this Example 1 

55 Example 2: Assay for smooth muscle migration 

Smooth muscle migration was measured in a chamber divided by a semi-permeable filter to which rHC gp-39 
[2 3-766 mg/ml) was bound on one side On the other side human fetal smooth muscle cells were cultured The extent 
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io which celis migrated into the filter was monitorea colonmetrically (optical density) rHC qp-39 elicited a stronger 
migration response in this assay than P DGF a known stimulator of smooth muscle migration This assay in this Example 
1 can aiso be readily used for analysis of HC gp-39L 

5 Example 3: Production of polyclonal antibodies generated against HC gp-39L 

A partial HC gp-39L protein has been expressed in £ coli and used to generate polyclonal antiserum A 1461 bp 
Ndel-Xhol cDNA fragment of HC gp-39L was cloned in frame as a fusion with an N-terminal His tag in the Pet 16B 
vector system (Novagen) These constructs were transformed into E colt through standard methods The cells were 
? o propagated lysed. and the protein purified by nickel affinity chromatography The purified fusion protein was used to 
immunize rabbits for the production of polyclonal antiserum 

Example 4: HC gp-39L is associated with the membrane fraction of lymphocytes 

is Polyclonal antiserum was used to detect HC gp-39L protein on Western blots Protein is detected in whole cell 

lysates of lymphocytes It is also detected in the membrane fraction but is not associated with the cytoplasmic fraction 
or secreted into the media HC gp-39L may function as a eel! surface lymphocyte marker for a subset of activated 
lymphocytes (inflammatory or tissue remodeling diseases} and be detected by FACS analysis, a secondary detection 
method known to those of skill in the art 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Kirkpatrick, Robert 
Rosenberg, Martin 

(n) TITLE OF THE INVENTION: Human Cartilage Glycoprotein 

(in) NUMBER OF SEQUENCES : 2 

(iv) CORRESPONDENCE ADERESS : 

'A) ADDRESSEE: SmithKline Beecham Corporation 

;B) STREET: 7 09 Swedeland Road 

{C) CITY: King of Prussia 

(D) STATE : PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
<C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/016,532 
(3) FILING DATE: 03-MAY-1996 



(viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: Han, William T 

(B) REGISTRATION NUMBER: 5219 

(C) REFERENCE/ DOCKET NUMBER: P503 90 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-5219 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAGCACCTGT GGCTGGGGAG CCCAGATGAA GTGTGGCTCT ATCTTGTATG TGAGCACACC 6 0 

CACATTTTCA CTGCCATTAT CTGGGACAGC AGAACCAGGT TTGGCTCAAC AGATTTCTCT 12 0 

TTCCACCCAT CTATTGCAGG TGTAGTGGTC TTGCTGCTTC TCCAGGGAGG ATCTGCCTAC 180 

AAACTGGTTT GCTACTTTAC CAACTGGTCC CAGGACCGGC AGGAACCAGG AAAATTCACC 24 0 

CCTGAGAATA TTGACCCCTT CCTATGCTCT CATCTCATCT ATTCATTCGC CAGCATCGAA 300 

AACAACAAGG TTATCATCAA GGACAAGAGT GAAGTGATGC TCTACCAGAC CATCAACAGT 360 
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CTCAAAACCA 
TCCAAAGGGT 
ATAATCCTGT 
GATCAGAAAG 
AAGGACTTCA 
AGGCAAATGA 
AACCTCCTGT 
AGCCCTCTGA 
GCTGTGGGGT 
ACATATGGGC 
GGCCCTGGAG 
TGCCAGTTCC 
GTCAAGGGGA 
TTCTTAAAGA 
ACTGGCAAAT 
GGCTCCCTGT 
CTGCTCTCTC 
TCTTCCAAGC 
GCAGCTGTTG 



AGAATCCCAA 

TCCACCCTAT 

TTCTGAGGAA 

AAAACACTCA 

CAAAATCCAC 

TTGATAACAG 
CCTTTGACTT 

GCAAGGGGTG 
ACTGGATACA 
ACTCCTTCAC 
CTGCTGGACC 
TGAAAGGAGC 
ACCAGTGGGT 
ATTTAAACCT 
CCTGCAACCA 
GAAGGATTAA 
CCAGGAATTC 
CTTTCCTGAC 
ACTTGTTGCC 



ACTGAAAATT 
GGTGGATTCT 
CCATAACTTT 
TTTCACTGTG 
CAAGGAAAGG 
CTATCAAGTT 
CCATGGGTCT 
GCAGGACAGA 
TAAGGGAATG 
ACTGGCCTCT 
CATCACAGAG 
CAAGATCACG 
GGGCTATGAT 
GGGAGGAGCC 
GGGCCCTTAC 
CTTACAGAGA 
TCATGTGGGA 
TTCCTCTTAG 
CTGAAGTACA 



CTCTTGTCCA 
TCTACATCAC 
GATGGACTGG 
CTGATTCATG 
CTTCTCTTGA 
GAGAAACTGG 
TGGGAAAAGC 
GGGCCAAGCT 
CCATCAGAGA 
GCAGAAACCA 
TCTTCAGGCT 
CGGCTCCAGG 
GATGTGAAGA 
ATGATCTGGT 
CCTCTTGTCC 
AGCAGGCAAG 
TTCCCCTTGC 
ATCATAGATT 
ATAAAAAAAA 



TTGGAGGGTA 
GCTTGGAATT 
ATGTAAGCTG 
AGTTAGCAGA 
CTGCGGGCGT 
CAAAAGATCT 
CCCTTATCAC 
CCTACTACAA 
AGGTGGTCAT 
CCGTGGGGGC 
TCCTGGCCTA 
ATCAGCAGGT 
GTATGGAGAC 
CTATTGACAT 
AAGCAGTCAA 
ATGACCTTGC 
CAGGATGGCC 
GGACCTGGTT 
TTCATTTTGC 



CCTGTTTGGT 
CATTAACTCC 
GATCTACCCA 
AGCCTTTCAG 
ATCTGCAGGG 
GGATTTCATC 
TGGCCACAAC 
TGTGGAATAT 
GGGCATCCCC 
CCCTGCCTCT 
TTATGAGATC 
TCCC7ACGCA 
CAAGGTTCAG 
GGATGACTTC 
GAGAAGCCTT 
TGCCTGGGGC 
TTTGGATCTC 
TTGTTTTCCT 
TCCAGT 



420 
4G0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1496 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 423 amino acids 
( E ) TYPE: airino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
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Arg 
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Gly 


Ser 
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Ser 
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Pro 


Ser 


He 


Ala 


Gly 


Val 
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Val 


Val 
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Gly 
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Ser 


Ala 


Tyr 
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Val 


Cys 
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60 










Tyr 


Phe 


Thr 


Asn 


Trp 


Ser 


Gin 


Asp 


Arg 


Gin 


Glu 


Pro 


Gly 


Lys 


Phe 


Thr 


65 










70 










75 










80 


Pro 


Glu 
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He 
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Pro 
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He 


Tyr 


Ser 
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Ser 


Glu 


Val 








100 
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Trp 


Val 


Gly 




355 










36C 
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Claims 

1. A nucleic acid sequence encoding HC gp-39L 

2. A protein encoded by the a nucleic acid sequence comprising the sequence of claim 1 . 

3. A nucleic acid sequence comprising the sequence of SEQ ID NO: 1 

4. A metnod of diagnosing a tissue remodeling disorder related to expression of a mutated HC gp-39L protein in a 
host comprising 

(a) synthesizing PCR primers complementary to a nucleic acid encoding a normal HC gp-39L protein, 
(bj obtaining nucleic acids from celis of a host: 

(c) amplifying the obtained nucleic acids with the synthesized PCR primers to obtains an amplified product: and 

(d) comparing the amplified product to a normal genotype encoding the normal HC gp-39L protein to detect 
any mutations in the amplified product 

5. The method of claim 4 wherein the HC gp-39L comprises the polypeptide sequence of SEQ ID NO 1 

6. A method of detecting altered expression of a HC gp-39L protein in a host comprising: 

(a) preparing a first antibody specific to a HC gp-39L protein: 

(b) preparing a reporter antibody against the first antibody said reported antibody linked to an attached de- 
tectable reagent. 

(c) obtaining a sample from a host 

(d) incubating said sample on a solid suoport which binds any proteins in said sample 

fei incubating said first antibody with any bound protein on said solid support so that said first antibody attaches 
to any HC gp-39L proteins bound to said soiid support 

if) contacting said reporter antibody linked to a detectable reagent with said attached first antibody on said 
soiid support. 

(g) detecting said detectable reagent: and 

(h) determining an amount ot HC gp-39L protein m said sample 

7. The method of claim 6 wherein the HC gp-39L comprises the polypeptide sequence of SEQ ID NO 1 
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